Does Patent IR Profit from Linguistics or Maximum Query Length?
نویسندگان
چکیده
In 2011, the University of Hildesheim and Chemnitz University of Technology participated together in the CLEF Intellectual Property Track. We focused on the prior art candidate search, which was already provided for the third time. Our group submitted seven runs ranging from simple bag of words to linguistic phrases. The aim of our experiments was to examine the effectiveness of different query strategies. Especially, we wanted to evaluate the advantage of linguistic phrases in contrast to very long bag of words queries. Phrases were extracted using a special extraction component, which has been developed by the University of Hildesheim.
منابع مشابه
Model to Support Patent Retrieval in the Context of Innovation- Processes by Means of Dialogue and Information Visualisation
Innovations are an essential factor of competition for manufacturing companies in technical industries. Patent information plays an important role within innovation-processes and for human innovators working on innovations. Innovation-processes support the combination of cross-organisational spread information and resources from patent databases and digital libraries is necessary in order to ga...
متن کاملStrategies for Effective Chemical Information Retrieval
We participated in the technology survey and prior art search subtasks of the TREC 2009 Chemical IR Track. This paper describes the methods developed for these two tasks. For the technology survey task, we propose a method that constructs highly structured queries to do retrieval on different fields of chemical patents and documents in a weighted way. The proposed method i) enriches these struc...
متن کاملQuery Terms Extraction from Patent Document for Invalidity Search
This paper describes our patent retrieval system participated in the NTCIR-5 Patent Retrieval Task, Document Retrieval Subtask. The main scope of our method is the appropriate query expansion to improve recall. We extracted query terms from the topic claim, and expanded query terms extracted from sentences explained in the patent document including the topic claim. The explanation sentences wer...
متن کاملQuery Reformulation in Collaborative Information Retrieval
Information retrieval (IR) systems utilize user feedback for generating optimal queries with respect to a particular information need. However the methods that have been developed in IR for generating these queries do not memorize information gathered from previous search processes, and hence can not use such information in new search processes. Thus each new search process does not know anythi...
متن کاملTREC Chemical IR Track 2009: A Distributed Dimensional Indexing Model for Chemical Patent Search
For the TREC-2009 Chemical IR Track, we explore development of a distributed information retrieval system based on a dimensional data model. The indexing model supports named entity identification and aggregation of term statistics at multiple levels of patent structure including individual words, sentences, claims, descriptions, abstracts, and titles. The system was deployed across 15 Amazon W...
متن کامل